Coupled dynamics of RNA folding and nanopore translocation 



in : 
o . 
o ■ 

<N ; 

Cu- 
te 1 

m : 
o\ : 



d 



(N 
> 

O 
O 
oo 
O 
in 
o 



i , 



X 



Ralf Bundschuh 1 and Ulrich Gerland 2 
1 Department of Physics, The Ohio State University, Columbus, Ohio 
2 Arnold- Sommerf eld Center for Theoretical Physics and Center for Nanoscience (CeNS), 
LMU Munchen, Theresienstrasse 37, 80333 Munchen, Germany 
(Dated: February 9, 2008) 

The translocation of structured RNA or DNA molecules through narrow pores necessitates the 
opening of all base pairs. Here, we study the interplay between the dynamics of translocation and 
base-pairing theoretically, using kinetic Monte Carlo simulations and analytical methods. We find 
that the transient formation of basepairs that do not occur in the ground state can significantly 
speed up translocation. 



The dynamics of base-pairing in DNA and RNA 
molecules plays an important role in many biological pro- 
cesses ranging from DNA replication to RNA folding Q . 
Often, the dynamics is coupled to other kinetic processes. 
For instance, RNA folds as it is synthesized, while it 
unfolds and refolds as it passes through the ribosome 
during translation. A series of recent single-molecule 
experiments Q used electric fields to drive RNA and 
DNA through tiny pores, which let single but not double 
strands pass. The case of unstructured, e.g. homopoly- 
meric molecules, is particularly well characterized, with 
many theoretical studies complementing the experiments 
fcJELOg. In contrast, the effect of base-pairing on translo- 
cation, see Fig. ^ is only beginning to be explored, with 
a few existing experiments [a, BLH and first steps to- 
wards a theoretical description [TjOi- In this letter, we 
specifically examine the interplay between base-pairing 
and translocation dynamics. 

Already homopolymer translocation displays rich dy- 
namics. Experimentally, one can obtain the full distribu- 
tion p(r) of the times r for the translocation of a molecule 
from the cis to the trans side of the pore. Hence, the goal 
of theoretical descriptions is to determine p(r) or its mo- 
ments as a function of system parameters such as the 
chain length N, the voltage drop across the pore, and 
temperature. Most treatments reduce the dynamics to a 
one-dimensional reaction coordinate, m, which measures 
how many bases have reached the trans side 0, Q . This 
approach is justified only when the polymer degrees of 
freedom equilibrate rapidly compared to the timescale of 
translocation. It fails for long chains, where the inter- 
nal polymer dynamics and friction with the solvent limit 
the translocation speed However, most experiments 
so far were performed with short chains where the fric- 
tion between the polymer and the pore dominates the 
translocation dynamics 0, 0|- In a minimal model for 
these experiments the index m is increased with a for- 
ward rate k + and decreased with a backward rate 
When expressed in terms of a continuous reaction coor- 
dinate < x < N, the drift-diffusion equation 



tion for x, where D is an effective diffusion constant and 
the drift velocity v is induced by the applied voltage. 
The ratio £ = D/v defines a diffusive lengthscale, below 
which the drift v is negligible compared to the Brownian 
motion From QJ, one obtains p(r) as the probability 
current into the absorbing boundary at x = N. 

Base-pairing interactions affect the translocation 
dynamics significantly by introducing sequence- and 
structure-dependent kinetic barriers Q • Previous 
theoretical work argues that these barriers can be ex- 
ploited to measure the secondary structure of an RNA 
molecule, if its sequence is already known. However, 
the interplay of the translocation and base-pairing dy- 
namics also poses new physical questions, which will be 
important for the design of applications. For instance, 
one expects two limiting cases for the dynamics: slow 
translocation, during which the base-pairing pattern re- 
mains equilibrated at almost all times and on both sides 
of the pore, and fast translocation, where an essentially 
frozen base-pairing pattern on the cis side is unzipped 
as the molecule passes through the pore. Ref. consid- 
ers only the latter limit. Here, we are interested in the 
crossover from the slow to the fast translocation limit 
and address the following questions: (i) Can the physics 
of the crossover be described in simple terms? (ii) What 
is the nature and size of the effective free energy barri- 
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describes the time evolution of the probability distribu- 



FIG. 1: Translocation of a structured RNA molecule through 
a narrow pore, which allows single but not double strands to 
pass. The driving force is an applied voltage, which acts on 
the negatively charged backbone of the RNA. An appropriate 
reaction coordinate is the number of bases m(t) that have 
reached the trans side. The dynamics of m(t) is coupled to the 
dynamics of the base-pairing patterns S C i S (t) and Strans(t) on 
the cis and trans sides. 
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ers for translocation? (iii) How is the distribution p(r) 
affected by the presence of secondary structure ? 

Whether an experiment is closer to the slow or fast 
translocation limit can depend on the sequence/structure 
of the molecule under study, the bias voltage, and the 
friction coefficient between the polymer and the pore. 
The latter determines the rate fco for translocation by a 
single (unpaired) base at zero bias, which we use as a 
control parameter for the crossover. Microscopically, fco 
is determined by the diameter of the pore and its surface 
chemistry, both of which are in principle tunable, cither 
by mutation in the case of protein pores or through the 
fabrication process in the case of solid-state pores 
We will see below that the pore depth is also a relevant 
parameter for the questions raised above. 

Model. — We focus on the case of RNA, even though 
our results apply equally to single-stranded DNA. Since 
kinetic barriers slow the translocation of structured 
RNA, we expect the description by the reaction coordi- 
nate m to be valid over an even wider regime of lengths 
N than for unstructured RNA. However, for structured 
RNA, the dynamics of m(t) is coupled to the dynamics of 
the RNA base-pairing patterns S C is (t) , Strans (t) on both 
sides of the pore. In our model, illustrated in Fig.^ only 
unpaired bases can enter the pore. We describe the dy- 
namics of S C is(t) and St ra ns(t) using Monte Carlo kinetics 
with three elementary moves: opening of a pair, closing, 
or a shift in the binding partner of a base [12j . We use 
a single rate fcb P for all moves that are energetically fa- 
vorable, and the rate fcb p e~ AG , if the move increases the 
Gibbs free energy by AG (all energies are in units of the 
thermal energy fegT). While this assumption does cer- 
tainly not hold on the microscopic level, we expect that it 
will not affect the qualitative features of the dynamics on 
long timescales. Indeed, a recent model of force-induced 
RNA unfolding ^jjj has shown a remarkably good agree- 
ment with experiment using similar assumptions. 

To calculate the Gibbs free energies we use a simplified 
energy model, which allows for CG, AU, and GU base- 
pairs with a binding energy of 2, 1, and 0, respectively. 
The free energy cost for any loop (interior, bulge, mul- 
tiloop, or hairpin) of size I is 3vhi(£), where v ~ 0.6 
is the Flory exponent. Wc exclude hairpin loops of size 
I < 3, which cannot occur in RNA structures due to 
steric constraints. The applied voltage drops primar- 
ily directly across the pore, leading to an energy gain 
AU when a monomer traverses the pore. Thus, the ra- 
tio of the forward and backward rates is given by the 
Boltzmann factor e AU . We assume the symmetric choice 
k± = fc e ±AU / 2 corresponding to a centered transition 
state along the microscopic reaction pathway for translo- 
cation by one monomer. We denote by h the number of 
bases that fit inside the pore and assume these cannot 
base-pair at all. While translocation consists of an en- 
trance stage followed by a passage stage, we focus only on 
the passage dynamics. Hence, we choose m(0) = as ini- 




FIG. 2: Mean translocation time (r) for RNAs with the same 
length TV = 50, but different structures (we plot (t) in units 
of fco^ 1 , which is 0(fj,s) in typical experiments pj) ■ Sym- 
bols represent simulation data, while dashed lines show the 
adiabatic limit and solid lines the expression for the 
two random walker model, (a) Crossover from slow to fast 
translocation at a fixed bias AU = 0.5 ksT. Typical base- 
pairing rates are fcbp ~ 1 /^s _1 [r^|. so that typical experiments 
would fall into the crossover region, (b) Dependence on the 
bias at a fixed ratio fco /fcbp = 1. 

tial condition and stop the simulation when m(r) = N, 
whereas we do not allow the RNA to exit on the cis side. 

Mean translocation times. — To explore the effect of 
base-pairing on the translocation dynamics, we use three 
different sequences with identical length N = 50; one 
wcll-dehncd structure (a hairpin with 23 random base 
pairs and a loop of 4 bases), one random sequence, and an 
unstructured homopolymer for comparison. Fig.|2]shows 
their mean translocation times (r) for different combina- 
tions of the parameters fco, fcbp, an( i AC/ (here we have 
used an idealized pore with h = 1). The average is taken 
over 1000 independent simulations, and (r) is plotted in 
units of the 'hopping time' ko _1 for an unpaired base 
across the pore at zero bias. 

Fig. displays the transition from the slow to the fast 
translocation limit by varying the ratio of the two char- 
acteristic timescales in our model, fco/fcbp, at constant 
bias AU = 0.5. For the homopolymer, the base-pairing 
timescale is irrelevant and hence ko{r) is constant. For 
the structured RNAs, the curves also approach a con- 
stant limit at small fco/^bp, indicating that translocation 
is sufficiently slow to allow equilibration of the basepair- 
ing pattern whenever the RNA translocates by a base. 
Note that this limit value of fco(r) depends strongly on 
the sequence/structure of the RNA. In the other extreme, 
ko 3> fcbp, base-pair opening in front of the pore is the 
rate limiting process. The crossover between these lim- 
its occurs at ko ~ fcbp- We will now seek a quantitative 
understanding of these simulation results. 

Slow translocation limit. — In the limit fco <C fcbp, 
the translocation dynamics m(t) becomes independent 
of the base-pairing kinetics. The effect of base-pairing 
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then amounts to reducing the translocation rates fc+, 
k- by factors equal to the probability that the base 
entering the pore is unpaired. Such equilibrium prop- 
erties of the base-pairing interaction can be expressed 
through the partition function Zij, the total statisti- 
cal weight of all secondary structures for the substrand 
from base i to base j of the RNA sequence. For in- 
stance, the probability for the j-th base on the trans 
side to be unbound is Zij-i/Zij. Hence, the translo- 
cation dynamics m(t) becomes a ID random walk with 
site-dependent hopping rates fc_(m) = k-Z\ m ^.\jZ\^ m 
and k + (m) = k + Z m+h+2 .N /Z m + h +i . N . Using the mean 
first passage time formalism [14j , we then find the mean 
translocation time 
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where T h (m) = - \xt(Z\ tm Z m +h+x,N / Z\,n) is the free 
energy cost of placing a free RNA from solution into po- 
sition to of a pore with depth h. We obtain the partition 
functions Zj , by generalizing the well-known recursion 
relations of [l!| to a logarithmic multiloop cost ^(|, in 
order to derive the exact adiabatic limit of our model. 

In Fig. 01, the dashed lines show that our kinetic sim- 
ulations indeed converge to the adiabatic limit (J2J) for 
small ko/kbp- However, besides providing a check on 
our simulations, Eq. also helps to understand the na- 
ture of the relevant free energy barriers. For example, 
the hairpin structure has a total binding free energy of 
w 33 kgT. Yet, its translocation time compared to that 
of the homopolymer is not more than 1000 times longer 
(at AU = 0.5 k B T and k < k bp ). The applied voltage 
lowers the free energy barrier only about 10 fc^T and does 
not suffice to explain the comparatively small effect of the 
stable structure on the translocation time. Inspection of 
the free energy landscape J-\{m) of the hairpin, shown 
in the inset of Fig.^, reveals that the formation of non- 
native base-pairs on both sides of the pore lowers T\ (m) 
considerably in the central part, and explains the effect 
(for comparison, the landscape for only the native hairpin 
structure is shown with a dashed line). Hence, the major 
barrier for the translocation of structured molecules is 
not unfolding of the complete structure, but partial un- 
folding of the structure until the formation of non-native 
basepairs speeds up translocation. 

Crossover to fast translocation. — In Fig. the 
crossover between slow and fast transition occurs in the 
range 0.1 < ko/kh P < 10 for both structured RNA se- 
quences. Incidentally, typical experimental values for the 
base-pairing and translocation rates are comparable with 
~ k ~ 1 lis' Hence it is important to charac- 

terize the translocation behavior in the transition region. 

Note that while the higher stability of the hairpin 
yields a longer translocation time, the shape of the promi- 
nent bend in the curves of Fig. [SJi is independent of the 



sequence as long as it is structured. This observation 
encourages us to develop a simplistic model: The domi- 
nant effect giving rise to the prominent bend is a kinetic 
competition between the opening/closing dynamics of the 
base pair next to the cis side of the pore, and the posi- 
tion of the pore with respect to the RNA, which is driven 
into the structure by the external bias. This process re- 
sembles two impenetrable random walkers biased to run 
into each other. Walker 'P' representing the pore has 
a hopping rate k + towards walker 'R' representing the 
RNA and a rate fc_ away from 'R'. Walker 'R', in turn, 
moves towards 'P' at the rate k^ p for base pair closing, 
and away at an unbinding rate k u that is reduced by the 
stability of the base pair. If none of these rates depends 
on the positions of 'R' and 'P', one finds that the cen- 
ter of mass position of the walkers drifts with a velocity 
v = (k u k + — fcb p fc_)/(/cbp + Assuming this drift 

dominates over diffusion, (r) takes the simple form 



N f fco 
r )=N v = T (a + b-± 
k \ k hp 



(3) 



with dimensionless constants a and b. The solid lines 
in Fig. 0i show Eq. (J3J, where we determined a from 
the slow translocation limit and used b as fitting pa- 
rameter. The agreement with the simulation data sug- 
gests that the shape of the curves is indeed largely de- 
termined by a kinetic competition between translocation 
and the local base-pairing dynamics in the vicinity of the 
pore. The effects of large-scale rearrangements in the 
base-pairing pattern during translocation are too subtle 
to be discernible in Fig. 01. However, reformation of 
base-pairs on the trans side, which we have ignored to 
obtain Eq. J2J, strongly influences the parameter b. 

Voltage dependence. — In Fig. ^jp the mean transloca- 
tion time of our three test sequences is plotted against 
the bias AU, where symbols represent simulation data at 
kbp = ko and dashed lines the adiabatic limit (0 . For the 
homopolymer, the adiabatic limit is exact and displays 
the well-known crossover from a diffusion-dominated 
(t) ~ N 2 /k at AU < 1/JV, to a drift-dominated 
(t) ~ N/k + at AU > 1/JV. For the structured RNAs, 
the mean translocation time also levels off at AU ~ l/N, 
since the molecule "feels" the bias only if its total free en- 
ergy differs by at least one k B T between the two sides of 
the pore. In this case, however, the simulation data lies 
somewhat above the adiabatic limit iffll. consistent with 
the fact that fcb p = &o is within the crossover regime of 
Fig. Et- Clearly, as we leave the adiabatic limit, the ef- 
fective kinetic barriers for translocation increase and iffll 
gives only a lower bound on the mean translocation time. 
As AU is increased to ~ k B T (right edge of the plot), 
the system is more strongly perturbed and the deviation 
from the adiabatic limit becomes more significant. 

Translocation time distributions. — In Fig. [2t>, the 
translocation dynamics of the homopolymer is drift- 
dominated for AU > 1/N. Is this equally true for the 
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FIG. 3: Translocation time distributions for (a) the RNA 
hairpin and (b) the random sequence at AU = 0.5 and ko = 
kb p . The histograms show simulation data, whereas the solid 
line in (a) is an exponential fit and in (b) a fit to the drift- 
diffusion form Q . The insets show the free energy landscapes 
Ti(m) of the adiabatic limit sec main text for details. 

translocation dynamics of the structured RNAs? To ad- 
dress this question, we examine the full distribution p(r), 
which displays more detailed signatures of the transloca- 
tion dynamics than the average (r) and is routinely mea- 
sured experimentally. The histograms in Figs. |2t an d 
[3Jd show p(r) for (a) the hairpin and (b) the random se- 
quence, from simulations at fco = fcbp and AU — 0.5 3> 
1/N. We observe that the distributions are markedly dif- 
ferent: For the hairpin, p{r) follows an exponential law 
(solid line) characteristic for thermally activated cross- 
ing of a single free energy barrier. In contrast, p(t) for 
the random sequence has the shape characteristic for the 
drift-diffusion process which is appropriate for un- 
structured polymers (the solid line in Fig. |2t> is a fit with 
the associated mean first passage time distribution [4J). 

Indeed, the free energy landscape .Fi(m) of the random 
sequence is remarkably flat (see inset of Fig. |3Jd) . This 
is due to the fact that the random sequence can accom- 
modate many dissimilar structures with similar free ener- 
gies: when the pore brakes up part of the structure of the 
random sequence, a (non-native) structure can reform on 
the trans side, which is almost equally stable. Hence, the 
translocation dynamics of the random sequence is qual- 
itatively similar to that of the homopolymer, only with 
a reduced diffusion coefficient D and drift velocity v due 
to the roughness of the free energy landscape. 

Conclusions. — Compared to the thermodynamics, the 
dynamics of base-pairing is far less understood. Translo- 
cation experiments are emerging as a valuable tool to 
probe these dynamics on the single-molecule level. Here, 
we have studied the interplay between translocation and 
base-pairing within a simplified, but explicit theoretical 
model. In the slow translocation limit, the transloca- 
tion dynamics of our model can be understood analyt- 
ically, which is useful to interpret our simulations even 
outside the slow translocation regime. We find that the 
formation of non-native basepairs can significantly speed 



up translocation, rendering the effective kinetic barriers 
much smaller than might be naively expected. Further- 
more, we showed that the crossover between slow and fast 
translocation can be described by a simple phenomeno- 
logical model. In the future, nanopores might be used to 
sort RNAs according to their structures. 
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